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ABSTRACT 

One can measure the readability of any piece of 
reading material for an individual or a group by having the selection 
read and then testing for comprehension. Increasingly the cloze 
procedure has come to be used in preference to multiple-choice 
questions. Other readability scales have been based on previously 
scaled passages, on carefully graded books, and on the combined 
judgments of a group of experts. Two variables have consistently 
stood out as providing the best combination in the measurement of 
readability. The first is the difficulty of the vocabulary used, 
which is usually measured by finding the percentage of words that do 
not appear in a specific list of common, easy words. Spelling 
patterns as indices of vocabulary difficulty have just begun to be 
explored and seem promising. The second widely used variable is 
average sentence length, which seems to represent the many specific 
reasons why beginnings have been made in the automatic computer 
scoring of complicated linguistic variables such as syntactic depth 
and density. The specific factors that make some reading material 
hard to understand, such as vagueness, ambiguity, and lack of 
explicitness, are areas in which more research is needed. 
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SOME NEW DEVELOFMENTS OH READABILITY 



(Presented at the Fifth IRA Vforld Congress on Reading, 
Vienna, Austria, August 13> 197^) 



One of the major tasks for the teacher of reading is to provide 
the student reader with material that he can read and understand. The 
tern r* ^adability refers to the qualities of reading material which 
determine how easy it is to understand, how fluently it can be read, 
and how much interest it generates. This paper, however, is concerned 
only with readability in the sense of ease or difficulty of comprehension. 

The teim ••new development?* needs to be made more explicit. Two fairly 
recent publications provide a starting point. One is a revised annotated 
bibliography published by IRA in 1971 entitled Readability and Reading ^ 
COTipiled by Seels and Dole. This provides good coverage of research 
published between I965 and 1970. The other is John GiUiland^s Readability ^ 
a Teaching of Reading Monograph of the U.K.R.A., published in 1972. The 
present paper emphasizes developments too recent to have been covered in 
those two very helpful sources. Some earlier work will also be discussed. 

On first thought it may seem that the logical way to 'determine the 
readability of a selection is to give it to someone to read and to check 
his comprehension. Such a procedure, sometimes called "trying the book on . 
for siae,** can establish how well that person has been able to read It. 
But usually we want to predict how a selection, or a yhole book, will fit 
an individual or group* s ability to \inderstand before deciding if they 
should be asked to read it. 

The Measurement of Readability 

To get away from the need to try a book with each potential user it is 
necessary to obtain a score which can express the book's readability on 
an easily intearpreted scale. The way this is done is to measure its 
scores on characteristics which can predict ^Aiere it would belong on a 
scale of reading se] ections whose readability scores have already been 
established. 

The scale of reading selections that has been used most widely in 
readability research is a collection called Standard Test Lessons in Reading 
by McCall and Crabbs, originally pablished in the 1920 's and most recently 
revised in I96I. There are five booklets | each containing about 70 one-^^age 
test lessons. For each lesson there is a short selection, followed by 
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about ten multiple-choice questions. At the bottom is a table giving 
the pirade score corresponding to each possible number of right answers. 
These grade scores were originally established by giving the test lessons 
and a standardized reading test to several thousand pupils. The read- 
ability fomulas developed by Lorge^i Flesch, and Dale and Chall all used 
the McCall-Crabbs test lessons as a criterion. 

Another way to set up a readability scale is to use the cloze 
technique to determine the comprehension difficulty of a number of 
selections. In the cloze procedure, words are deleted in a regular 
pattern and the reader's task is to write in each missing word. Usually 
every fifth word is deleted. The average number of correct cloze answers 
for a group can be used to rank selections in readability, setting up a 
scale against which other selections can be compared. The cloze technique 
has been used in much recent readability research, particularly by 
Bomuth (1968, 1969) and by Coleman (1968a, 1968b, 1971) • 

A third procedure is to take several series of carefully graded 
books and use their average characteristics to establish a graded scale. 
In the United States, widely used basal reader series have been used as 
criteria for the development of readability formulas by Spache and in 
the new Harris-Jacobson formulas which will be described a little later in 
this paper, 

A fourth procedure is to have the selections rated by a group of 
judges, using the average rating to establish a scale. However, the 
ratings of judges on the same book can vHry widely, Jongsma ( 1972) had 
12 Newbery Award-winning books rated by 44 school and public librarians. 
One book was rated all the way from third grade to twelfth grade in 
difficulty, 

Elements of Readability 

Mmerous studies have shown that the two main elements of the 
difficulty of reading material are the difficulty of the vocabalary used, 
or Semitic difficulty, and the difficulty of the sentence structure, 
in other words, its syntactical or grammatical complexity. Of these two 
factors, vocabulary difficulty has consistently shown somewhat greater 
importance, Klare (I968) wrote: ••Frequency of occurrence of words , , , 
clearly plays an all-^rvasive role in language usage. Not only do humans 
tend to use some words much more often than others, they recognize more 
frequent words more rapidly than less frequent, prefer them, and under- 
stand and learn them more readily. It is not surprising, therefore, that 
this variable has such a central role in the measurement of readability, 

Measurement of the difficulty of words has been done in a variety of 
ways. The procedure most often used is to determine what per cent of 
words in a sample are not found in a particular list of common words. 
Two lists compiled by Edgar Dale, a short list of 769 words and a list 
of about 3,000 words known to fourth grade children, have been used in 
several readability formulas including those by Lorge (1944), Dale and 
Chall (1948), and Spache (1953). Recently there have been efforts to 



A. J. Harris 3 



make these lists more up-to-date. Spache (197^) expanded the original short 
Dale list to IfOkl viords» adding words from the Harris-Jacobson Gore L^st 
(1972) and from library books of first and second grade levels, and revised 
his readability formula accordingly. 

The publication in 1972 of the Harris-Jacobson Basic Elementary Reading 
VocAbularies provided a possible improvement in the measurement of vocabiilary 
difficulty. The Harris-Jacobson word lists are based on a computerized 
analysis of about 4,500,000 running words found in 14 series of elementary 
school textbooks^ from beginning first grade through sixth grade. A first 
grade list, a combined first and second grade list, and a combined first, 
secondp and third grade list were tried out, and the combined first and 
second grade list proved to have the highest correlation with reading level 
from grade one through grade six. This combined list contains 912 root 
words and 1880 inflected forms, such as plurals and regular verb endings 
such as -ed and -ing t totalling 2,796 words. Since all allowable variants 
of a common root words are right in the list, qne does not have to remember 
a variety of rules as to what constitutes an unfamiliar word, or repeatedly 
consult the list of rules. If a word is not in the list and is not a proper 
noun it co\ints as unfamiliar. The percent of unfamiliar words has the 
highest correlation with basal reader level of a large nmber of measures 
of vocabulary and sentence difficulty tried out. 

Several other measures of vocabulary difficxilty have been used in one 
or more of the more than fifty different readability fomulas that have been 
published to date. Average number of letters per word, average number of 
syllables, average number of vowel letters, number of prefixes and s\if fixes, 
per cent of one-syllable words, per cent of words having three or moi^ 
syllables, and per cent of words having more than five letters have been 
tried. Studies have also shown that the proportion of certain parts of 
speech is related to readability* Coleman (1971)> for example, used 
number of pronouns and number of prepositions per 100 words as ininor 
elements in his readability formulas. 

All of these different ways of getting at vocabulary difficialty have 
substantial to high inter-correlations, but not all are of equal value 
in measuring readability. Unpublished results by Harris and Jacobson 
show that over the range from beginning first grade through sixth grade, 
per cent of uni'amiliar words had the highest correlation, .87; per cent 
of words with more than five letters, .80; and average number of letters 
per word, .74. The per cent of unfamiliar wor^s was 21 per cent more 
accurate in measuring readability than average nmber of letters per word. 
In recent years ways have developed for scoring many of these variables 
by computer. A computer can easily detenrdne whether a word is or is not 
in a given list, count its letters, and so on, but it is much harder to 
write a program which will accurately identify syllables or prefixes. 
In -the future only variables lAich can be scored by computer are likely 
to be retained in readability reseatch^ 
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In 1973 Harris and Jacobson reported that the per cent of words beginning 
with the letter e has a substantial correlation with difficulty in primary 
redding material, and Jacobson followed this up by identifying more than 
1,000 spelling patterns in English words and correlating then with difficulty. 
In a 197^ papjer Japbson has reported thay37 spelling patterns, when 
combined in a multiple regression equation, correlated .92 with primary 
readLng difficiilty. In further work not yet published he has found that 
whon one group of 12 spelling patterns is used for primary material and 
anohher group of 12 spelling pattaiDs is used for middle*grade material, 
a combined multiple correlation of^.965 was obtained. Using these spelling 
S^aSfififc is possible only with a computer and a very complicated computer 
program. Spelling patterns are probably related to readability in two 
different ways. One is that certain spelling patterns have greatly 
variable sound-symbol relationships. Initial e, for example, can represent 
at least seven different phonemes, as in each t ear, early t elephant t 
Bnglish t eight y and eyesight . This obviously increases the difficulty of 
decoding words beginning with e. The other way is that some spelling 
patterns tend to occur mainly in long words which are also uncommon and 
difficxilt, while other patterns appear predominantly in short, comon, easy 
words. Spelling patterns present a new and promising approach to the 
measurement of vocabulary difficulty, but are usable only by those Ulxo 
have appropriate computer resources* 

Sentence Difficulty 

From the beginning of readability study is has been recognized that 
the difficulty of a sentence involves elements beyond the difficulties 
of the words in it. The average number of words per sentence has been 
used in many readability formtilas and provides a reasonably tsatisfactoxy 
measure of those sentence characteristics that influence readability. 
In general, long sentences tend to have more modifiers wd qualifiersf 
more embedded phrases and clauses, and complex riither than simple 
structure. 

Since I96O a number of efforts have been made to mecaure the linguistic 
or syntactic diffictilty of sentences. Hunt (I965) developed a measure 
called the T-unit, vdiich is one main clause and its related words, phrases, 
and dauseci; in most cases, this amount to the number of words in the 
sentence. Golub (I969) developed a Syntactic Density Score vAich uses 
the T-unit and also takes account of such items as complex verb expansions 
and prepositional phrases. Since then Golub has developed a program 
which can obtain his Syntactic Density Score by computer(unpublished)# 
Bonnuth (196^) included measures of right depth of sentences and left 
depth of sentences among the variables used in some of his readability 
fonnulas. Botel and Granowsky (1972) developed a formula for measuring 
the syntactic density of individual sentences; the average score for the 
sentences in a selection should correlate with readability. 

Such methods of sentence analysis obviously help to show why one 
sentence may be harder to understand than another of similar length. 
But there is no evidence as yet that they can give a better indication 
of the difficulty of a ^Aiole selection than average sentence length does. 
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MacGinitie and Tretiak (1972) compared the sentence depth measures devised 
by Yngve (I96O) and by Allen (I968) with average sentence length and 
found that average sentence length gave the best prediction of readability 
when each was combined with a measure of word diffic\ilty» Similar research 
is needed for the other measures of sentence complexity. Since scoring 
sentences for these variables is slow and laborious, future research in 
this area is likely to emphasize variables that can be scored by computer* 

To sum up this discussion on the elements of compreliensibility or 
readability, the most important element in the difficulty of reading 
material is the difficulty of the vocabulary employed. This is measuraable 
in a variety of ways, among ^riiich the per cent of words not found in a 
list of common, easy words seems to be the most satisfactory as well as 
the one most commonly used. The other main variable affecting readability 
is sentence complexity, which is well represented by average sentence 
length. 

New Readability Fomulas 

The increasing use of computers in readability research has made it 
possible to develop readability formulas for special purposes with ease. 
However, most users of readability measures do not have c<»nputers avail atile, 
and ease and speed of scoring and computation are important factors in 
the choice of \^ch formula to use. This di.scussion will be confined to 
new formulas that are applicable by hand to children's materials for the 
elementary grades. 

At the Denver Convention of IRA in May, 1973» Harris and Jacobson presented 
some data on new readability formidas. Since then the formxxlas have been 
further revised and a large number of new formulas have been tried out.. 
Many different combinations of variables provide measures of about equal 
accuracy. Two formulas, one for grades one through three, and one which 
works from grade one through grade six but is mainly for use above third 
grade, have bi^en selected as providing the best combii. ition of ease and 
rapidity of application and high correlations with difficulty. Both 
fomulas have correlations of .90 with reader level. The primary formula 
has a standard error of estimate of .38 of a year; the other formula has 
a standard error of a71 of a year. Complete directions for using these 
formulas will appear in a book scheduled for publication early in 1975 
(Harris and Sipay, in press). ^ 

Spache {197^) published a revision of his formula for primary-grade 
material. The new formula is based on 100 samples of about 100 words 
each, or a total of about 10,000 words, in comparison to the Harris- 
Jacobson formulas which are based on 66 1 samples totalling about 135^000 
words. Spache reports the very high correlation of .95 with book grade 
level. 

Specific Features of Difficult Prose 

The kinds of conjunctions used by writers seems to influence the 
difficulty of their material. Stoodt (1972) reported on the comparative 
difficulty of ooimnonly used conjunctioijs for fourth grade children. 
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The ones that were best understood were and, for, and as# The follovdng 
were comparatively difficult: when, so, but, or, where, while t that, and if , 
Robertson (I968) found that clauses introduced by however, thus, wfaich ^ 
although t and yet were difficult for children in grades 4-6, and success 
in understanding such clauses was closely related to general reading 
comprehension, 

Rosenshine (1969) used long passages ^rfiich were equivalent in difficulty 
according for readability formulas but on \diich college students showed 
varying degrees of comprehension. He found five factors that tended to 
influence the readability of the passages, Difficiilty was increased by 
vagueness and ambiguity, vAiich is shown by the excessive use of indeter- 
minate qualifiers such as rather, any number of t quite a bit « etc,, 
and also by excessive use of probability words such as might y possibly , 
and sometimes, A factor aiding comprehension is frequent use of explain- 
ing links like because , in order to « if,,, then ^ and so forth, which call 
attention to a cause, a result, or a means, A third factor was frequent 
use of examples, which seemed more important in difficiilt material than in 
easy material. The fourth factor was use of a rule-example-rule pattern, 
in which a generalization would be stated, followed by one or more examples, 
and then by a restatement of the generalization or rule. This seemed more 
beneficial than either an inductive or deductive pattern of presentation. 
The fifth factor was irrelevancy, irfiich increased the difficulty of the 
material. Digressions and unnecessary restatements seem to lower the 
amount of information gain. These five factors are worth noting by those 
who write and lecture. 

We have only a beginning o£ raeearch on the specific features of prose 
writing that make material easy or difficult to read. We need to distinguish 
between inherent difficulty that resull.ts from the necessary use of concepts 
and relationships that are hard to explain and hard to iinderstand, and 
unnecessary difficulty vriiich is created by distinctive features of an . 
author* s style, Wen the subject-matter is inherently difficult, one can 
lower a readability formula score artificially by chopping long sentences 
into short ones and by substituting more common for less common synonyms, 
Geyer and Carey (1972) rewrote American History materials so as to reduce 
the Dale-Chall readability scores by about two grades. However, the 
comprehension scores of students were no higher on the easier than on the 
hirder original versions. Apparently the kind of rewriting that was 
pruided by the formula did not reduce the inherent difficulty of the 
content. 

All too often, an author creates unnecessary difficoolty for his readers. 
In addition to the factors of vagueness and irrelevancy discussed by 
Rosenshine, we have to guard against excessive use of the passive voice 
and the subjunctive mood. We have to check to see if modifying phrases 
and clauses are placed close to the items they modify. We have to note 
whether or not the antecedents of pronoims are easily identified. We 
have to observe whether sentences follow in a logical sequence, whether 
each paragraph actually has a main idea, Aether appropriate emphasis is 
given to the most important statements. These are all factors that are 
not measured by readability formulas, but which do affect the ease or 
difficulty with which one can grasp the thought content of written material. 
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Readability of Larguages Other Than English 

This paper has considered only readability research on English prose. 
Readability studies have been conducted on quite a number of other languages, 
including French, German, Dutch, Spanish, Hebrew, Hindi, Chinese, Korean, 
Japanese, and Vietnamese. The limitations of this paper do not permit 
going into detail on these studies, most of which were done before 1970. 
Professor G. D^andsheere of the University of Liege has been a leader in 
European research on readability. Very briefly, it seems safe to generalize 
that the two variables of vocabulary difficulty and sentence length are 
useful indicators of comprehensibility in any language. The features of 
specific written languages may encourage use of such variables as syllable 
counts or measures of the visual complexity of Chinese characters. 

Summary 

One can measure the readability of any piece of reading material for an 
individual or a group by having the selection read and testing for comprehension. 
Increasingly the cloze procedure has come to be used in preference to multiple- 
choice questions. A scale of selections can be arranged based on the average 
comprehension scox^s of a group. Other readability scales have been based 
on previously scaled passages, on carefully graded books, and on the combined 
judgments of a group of experts. Once a scale has been constructed, many 
different characteristics of the material can be measured, and each set of 
measurements can be correlated with the scale. Using the technique of 
multiple correlation, the best combination of variables and the best ways 
to weight them so as to get a maximum correlation can be discovered. 
Computer procedures which can score the selections for many variables as well 
as compute multiple correlations almost instantly have greatly simplified and 
speeded up readability research in recent years. 

Consistently two variables have stood out as providing the best combination 
in the measurement of readability. The first, and more important, is the 
difficulty of the vocabulary used. This is usually measured by finding the 
per cent of words that do not appear in a specific list of coninon, easy words. 
Spelling patterns as indices of vocabulary difficulty have just begun to be 
explored and seem promising. The second widely used variable is average 
sentence length, irfiich seems to represent the many specific reasons why 
long sentences are usually harder to understand than short ones. Now that 
beginnings have been made in the automatic computer scoring of complicated 
linguistic variables such as syntactic depth and density, more refined 
measures of sentence difficulty may appear in future readability formulas<. 
Recent formulas that -use the two variables of per cent of unfamiliar words 
and average number of words per sentence have mtiltiple correlations of .90 
or better with difficulty, seeming to Indicate that ttOt much further improve- 
ment is possible* 

The specific factors that make some reading material hard to oinderstand 
have begun to be analjrzed. Vagueness, ambiguity, and lack of explicltness 
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concerning important relationships are among the factors noted thus 
far. Much more research is needed in this area. 

Most readability research has been conducted on English prose. Some work 
has been done in a number of other languages, both European and non- 
European, which lies outside the scope of this paper. 

Progress in the measurement and understanding of readability will 
improve the ability of authors and teachers to achieve a better fit 
between the abilities of readers to understand and the materials they 
are expected to read. 
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